Optimizing Directed Acyclic Graphs via Simulated Annealing for Reconstructing Human Segmental Duplications

نویسنده

  • Borislav H. Hristov
چکیده

Segmental duplications, relatively long and nearly identical regions, prevalent in the mammalian genome, are successfully modeled by directed acyclic graphs. Reconstructing the evolutionary history of these genomic regions is a non-trivial, but important task, as segmental duplications harbor recent primate-specific and human-specific innovations and also mediate copy number variation within the human population. Using novel models derived by Kahn and Raphael, we formalize this reconstructon task as an optimization problem on the space of directed acyclic graphs. We employ a simulated annealing heuristic and describe an efficient way to use the technique to solve the optimization problem in general. We apply the heuristic to both maximum parsimony and maximum likelihood evolutionary models. We use these models to analyze segmental duplications in the human genome and reveal subtle relationships between these blocks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parsimony and likelihood reconstruction of human segmental duplications

MOTIVATION Segmental duplications > 1 kb in length with >or= 90% sequence identity between copies comprise nearly 5% of the human genome. They are frequently found in large, contiguous regions known as duplication blocks that can contain mosaic patterns of thousands of segmental duplications. Reconstructing the evolutionary history of these complex genomic regions is a non-trivial, but importan...

متن کامل

DAGchainer: a tool for mining segmental genome duplications and synteny

SUMMARY Given the positions of protein-coding genes along genomic sequence and probability values for protein alignments between genes, DAGchainer identifies chains of gene pairs sharing conserved order between genomic regions, by identifying paths through a directed acyclic graph (DAG). These chains of collinear gene pairs can represent segmentally duplicated regions and genes within a single ...

متن کامل

Automated memory-aware application distribution for Multi-processor System-on-Chips

Mapping of applications on a Multiprocessor System-on-Chip (MP-SoC) is a crucial step to optimize performance, energy and memory constraints at the same time. The problem is formulated as finding solutions to a cost function of the algorithm performing mapping and scheduling under strict constraints. Our solution is based on simultaneous optimization of execution time and memory consumption whe...

متن کامل

Using Simulated Annealing for Hard Real-Time Design-to-Time Scheduling

We present a two-parts scheduling algorithm based on simulated annealing and dynamical dispatching for the design-to-time scheduling of hard real-time applications. A decision-DAG (directed acyclic graph) is calculated and optimized offline using simulated annealing and then interpreted during runtime. The underlying system model consists of non-preemptable, periodic tasks with release times an...

متن کامل

Learning Inclusion-Optimal Chordal Graphs

Chordal graphs can be used to encode dependency models that are representable by both directed acyclic and undirected graphs. This paper discusses a very simple and efficient algorithm to learn the chordal structure of a probabilistic model from data. The algorithm is a greedy hillclimbing search algorithm that uses the inclusion boundary neighborhood over chordal graphs. In the limit of a larg...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010